Towards Portable Performance for Explicit Hydrodynamics Codes

نویسندگان

  • A. C. Mallinson
  • D. A. Beckingsale
  • W. P. Gaudin
  • J. A. Herdman
  • S. A. Jarvis
چکیده

Significantly increasing intra-node parallelism is widely recognised as being a key prerequisite for reaching exascale levels of computational performance. In future exascale systems it is likely that this performance improvement will be realised by increasing the parallelism available in traditional CPU devices and using massively-parallel hardware accelerators. The MPI programming model is starting to reach its scalability limit and is unable to take advantage of hardware accelerators; consequently, HPC centres (such as AWE) will have to decide how to develop their existing applications to best take advantage of future HPC system architectures. This work seeks to evaluate OpenCL as a candidate technology for implementing an alternative hybrid programming model, and whether it is able to deliver improved code portability whilst also maintaining or improving performance. On certain platforms the performance of our OpenCL implementation is within 4% of an optimised native version.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Performance Portable GPU Programming with RAJA

High-performance machines of today are already heterogeneous in nature with traditional multicores and accelerators such as GPUs and Xeon PHIs. It is a challenge to program these machines in a performance portable way with code that is maintainable across large production applications. RAJA is a recently introduced C/C++ programming approach targeting fine-grained parallelism in loops that is i...

متن کامل

Efficiency of Anti-Hourglassing Approaches in Finite Element Method (TECHNICAL NOTE)

one of the simplest numerical integration method which provides a large saving in computational efforts, is the well known one-point Gauss quadrature which is widely used for 4 nodes quadrilateral elements. On the other hand, the biggest disadvantage to one-point integration is the need to control the zero energy modes, called hourglassing modes, which arise. The efficiency of four different an...

متن کامل

Memory Throttling on BG/Q: A Case Study with Explicit Hydrodynamics

Power and energy efficiency are major concerns in future supercomputing systems. We expect that applications will be constrained to operate under a power budget and achieving the expected levels of performance will be challenging. Understanding how power is consumed by an application throughout its different phases will be necessary to shift power to those resources on the critical path. In thi...

متن کامل

A fast and explicit algorithm for simulating the dynamics of small dust grains with smoothed particle hydrodynamics

We describe a simple method for simulating the dynamics of small grains in a dusty gas, relevant to micron-sized grains in the interstellar medium and grains of centimetre size and smaller in protoplanetary discs. The method involves solving one extra diffusion equation for the dust fraction in addition to the usual equations of hydrodynamics. This “diffusion approximation for dust” is valid wh...

متن کامل

CloverLeaf: Preparing Hydrodynamics Codes for Exascale

In this work we directly evaluate five candidate programming models for future exascale applications (MPI, MPI+OpenMP, MPI+OpenACC, MPI+CUDA and CAF) using a recently developed Lagrangian-Eulerian explicit hydrodynamics mini-application. The aim of this work is to better inform the exacsale planning at large HPC centres such as AWE. Such organisations invest significant resources maintaining an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013